Pronunciation variants description using recognition error modeling with phonetic derivation hypotheses
نویسندگان
چکیده
This paper proposes a new method of pronunciation variant generation for reducing word error rate in conversational speech recognition. In particular, this paper focuses on the generation of alternative pronunciations from canonical forms by using the phonological knowledge derived from the analysis of a phonetic transcription corpus. The experimental results show that the pronunciation variation generated by the proposed method provides slightly better performance than a method based on manually written pronunciation. These results also demonstrate the applicability of phonological knowledge-based generation of pronunciation variation.
منابع مشابه
Generating proper name pro for automatic speech
Generating correct pronunciation of proper names remains one of the most difficult tasks in text-to-phoneme transcription. Although phonetic rules can be efficient in processing proper names of one language, foreign family names cannot be always correctly generated without additional pronunciation rules. The present study addresses the problem of pronunciation variants for French and foreign fa...
متن کاملUnsupervised Pronunciation Adaptation for Off-line Transcription of Japanese Lecture Speeches
Observing that most variations in pronunciation are strongly speaker and speaking style dependent, and that the introduction of pronunciation variants in a speaker-independent recognition system is of limited success, we refrain from applying multiple pronunciation variants in the speakerindependent case and instead introduce pronunciation variants without supervision when specializing the reco...
متن کاملA study of implicit and explicit modeling of coarticulation and pronunciation variation
In this paper, we focus on the modeling of coarticulation and pronunciation variation in Automatic Speech Recognition systems (ASR). Most ASR systems explicitly describe these production phenomena through context-dependent phoneme models and multiple pronunciation lexicons. Here, we explore the potential benefit of using feature spaces covering longer time segments in terms of implicit modeling...
متن کاملLarge vocabulary continuous speech recognition based on cross-morpheme phonetic information
In this paper, we present a novel method to regulate lexical connections among morpheme-based pronunciation lexicons for Korean large vocabulary continuous speech recognition (LVCSR) systems. A pronunciation dictionary plays an important role in subword-based LVCSR in that pronunciation variations such as coarticulation will deteriorate the performance of an LVCSR system if it is not well accou...
متن کاملModeling Pronunciation Variation in Conversational Speech using Syntax and Discourse
A significant source of variation in spontaneous speech is due to intra-speaker pronunciation changes. Previous work in automatic speech recognition has identified several factors that affect pronunciation variability such as phonetic context and speaking rate. This work examines new higher level information sources: syntax and discourse structure, specifically the relationship between these fa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000